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83281 PATENT 

IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 
In Re U.S. Patent Application ) 
Applicant: Infineon Technologies ) 
Serial No.: Not Yet Assigned ) 
Filed: Herewith ) 

For: METHOD AND DEVICE FOR ) 

CALCULATING A DISCRETE \ 

ORTHOGONAL TRANFORMATION \ 

SUCH AS FFT OR IFFT > 

PRELIMINARY AMENDMENT 

Assistant Commissioner for Patents 
Washington, D.C. 20231 

Sir: 

This is a Preliminary Amendment for entry in the above-identified application. 
In the Claims : 

Please amend the claims as follows: 

1 . (amended) A method for calculating an orthogonal discrete transform on the basis 
of the DIT method in prescribed intermediate steps, 
wherein 

a) the data are read from a memory organized on a page-for-page basis, 

b) the intermediate step prescribed by the transform is carried out, 

c) the resulting data are stored in a buffer memory, and 

d) the resulting data are written page-for-page from the buffer memory 
to the memory organized on a page-for-page basis. 



2. (amended) A method for calculating an orthogonal discrete transform on the basis 
of the DIF method in prescribed intermediate steps, 

wherein 

a) the data are read from a memory organized on a page-for-page basis, 

b) the data are stored in a buffer memory, 

c) the intermediate step prescribed by the transform is carried out, and 

d) the resulting data are written page-for-page from the buffer memory 
to the memory organized on a page-for-page basis. 

3 . (amended) A method for calculating an orthogonal discrete transform on the basis 
of the DIF method in prescribed intermediate steps, 

wherein 

the data are read from two memories organized on a page-for-page basis, such 
that the reading is organized on a page-for-page basis, the intermediate step prescribed by the 
transform is carried out, and 

the resulting data are again written page-for-page to the two memories organized 
on a page-for-page basis. 

4. (amended) The method as claimed in claim 1, wherein the discrete orthogonal 
transform is formed by an FFT, IFFT, DCT or IDCT. 
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7. (amended) An apparatus for carrying out the method as claimed in claim 1 
wherein 

the apparatus has a memory organized on a page-for-page basis, a processor and 
a directly organized memory which is arranged downstream of the processor. 

8. (amended) An apparatus for carrying out the method as claimed in claim 1 
wherein 

the apparatus has a memory organized on a page-for-page basis, a processor and 
a directly organized memory which is arranged upstream of the processor. 

9. (amended) The apparatus as claimed in claim 7, wherein the page-oriented 
memory is a large memory in relation to the directly organized buffer memory. 

10. (amended) The apparatus as claimed in claim 9, wherein a fast memory is used 
for the buffer memory. 

11. (amended) The apparatus as claimed in claim 7, wherein the page-oriented 
memory is a dram and buffer memory is an SRAM. 

12. (amended) The apparatus as claimed in claim 7, wherein the page-oriented 
memory has a size of 8 K addresses and the buffer memory has a size of 32 - 64 addresses. 



3 



13. (amended) An apparatus for carrying out the method as claimed in claim 3 
wherein 

the apparatus has two memories organized on a page-for-page basis and a processor. 



14. (amended) The apparatus as claimed in claim 13, wherein the page-oriented 
memories are of the same size. 

15 . (amended) The apparatus as claim in claim 14, wherein the page-oriented memory 
has a size of 4 K addresses. 

16. (amended) The apparatus as claimed in claim 7, wherein the processor provides 
a Butterfly. 

IN THE SPECIFICATION: 



Dated: July 18, 2001 

Welsh & Katz, Ltd. 

120 South Riverside Plaza, 22nd Floor 
Chicago, Illinois 60606 
Telephone: 312/655-1500 



Please add the attached Abstract to the specification. 



Respectfully submitted, 



WELSH & KATZ, LTD. 




Gerald T. Shekleton 
Registration No. 27,466 
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VERSION WITH MARKINGS TO SHOW CHANGES MADE 

1. (amended) A method for calculating an orthogonal discrete transform on the 
basis of the DIT method in prescribed intermediate steps, 

wherein 

a) the data are read from a memory [(1)] organized on a page-for-page basis, 

b) the intermediate step prescribed by the transform is carried out, 

c) the resulting data are stored in a buffer memory [(5)], and 

d) the resulting data are written page-for-page from the buffer memory [(5)] 
to the memory [(1)] organized on a page-for-page basis. 

2. (amended) A method for calculating an orthogonal discrete transform on the 
basis of the DIF method in prescribed intermediate steps, 

wherein 

a) the data are read from a memory [(1)] organized on a page-for-page basis, 

b) the data are stored in a buffer memory [(5)], 

c) the intermediate step prescribed by the transform is carried out, and 

d) the resulting data are written page-for-page from the buffer memory [(5)] 
to the memory [(1)] organized on a page-for-page basis. 

3. (amended) A method for calculating an orthogonal discrete transform on the 
basis of the DIF method in prescribed intermediate steps, 

wherein 
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the data are read from two memories [(3, 4)] organized on a page-for-page basis, 
such that the reading is organized on a page-for-page basis, the intermediate step prescribed by 
the transform is carried out, and 

the resulting data are again written page-for-page to the two memories [(3, 4)] 
organized on a page-for-page basis. 

4. (amended) The method as claimed in [one of claims 1 to 3] claim 1, wherein 
the discrete orthogonal transform is formed by an FFT, IFFT, DCT or IDCT. 

*2 7. (amended) An apparatus for carrying out the method as claimed in [one of 

IB 

m claims 1, 4, 6] claim 1 

if 5 

Lfl wherein 

lo the apparatus has a memory [(1)] organized on a page-for-page basis, a processor 

U r( 2 )] and a directly organized memory [(5)] which is arranged downstream of the 

3158!, 

fj? processor. 

H 

3s sis 

8. (amended) An apparatus for carrying out the method as claimed in [one of 

claims 1, 4, 6] claim 1 
wherein 

the apparatus has a memory [(1)] organized on a page-for-page basis, a processor 
[(2)] and a directly organized memory [(5)] which is arranged upstream of the 

processor. 
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9. (amended) The apparatus as claimed in [one of claims 7 or 8] claim 7, wherein 
the page-oriented memory [(1)] is a large memory in relation to the directly organized buffer 
memory [(5)]. 

10. (amended) The apparatus as claimed in claim 9, wherein a fast memory is used 
for the buffer memory [(5)]. 

11. (amended) The apparatus as claimed in [one of claims 7-10] claim 7, wherein 
the page-oriented memory [(1)] is a dram and buffer memory [(5)] is an SRAM. 

12. (amended) The apparatus as claimed in [one of claims 7-11] claim 7, wherein 
the page-oriented memory [(1)] has a size of 8 K addresses and the buffer memory [(5)] has a 
size of 32 - 64 addresses. 

13. (amended) An apparatus for carrying out the method as claimed in [one of 
claims 3, 4-6] claim 3 

wherein 

the apparatus has two memories [(3, 4)] organized on a page-for-page basis and 
a processor [(2)]. 

14. (amended) The apparatus as claimed in claim 13, wherein the page-oriented 
memories [(3, 4)] are of the same size. 



15. (amended) The apparatus as claim in claim 14, wherein the page-oriented 
memory [(3, 4)] has a size of 4 K addresses. 

16. (amended) The apparatus as claimed in [one of claims 7-15] claim 7 , wherein 
the processor [(2)] provides a Butterfly. 
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Description 

Method and apparatus for calculating a discrete^ 
orthogonal transform such as FFT or IFFT 

5 

The invention relates to a method and a circuit for 
calculating the fast Fourier transform and also the 
inverse transform thereof, called FFT and IFFT below, 
or transforms of similar structure, i.e. discrete 
10 orthogonal transforms. 

The FFT and IFFT are extremely important transforms in 
communications technology because, by way of example, 
they transform the description of a factual basis in 
15 the time domain into a factual basis in the frequency 
domain, and vice-versa. 



In digital signal processing, an 1 N point 1 discrete 
Fourier transform, called DFT below, is frequently 
20 calculated which is defined as follows: 

X(k) = £ x(n) W nk k = 0, 1, , N-l 

r.=0 

where w = e ^ j)27C/N 

25 

The complexity of calculating the DFT is proportional 
to 0(N 2 ). By using the FFT, it is possible to reduce the 
complexity of the calculation to 0(Nlog(N)). This is 
done by hierarchical splitting of the calculation into 
30 transforms having shorter successions. 

To calculate the FFT, there are two basic algorithms. 
One is called "Decimation in Frequency" (DIF) and the 
other is called "Decimation in Time" (DIT) . The text 
35 below deals with the DIT algorithm by way of example. 
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To calculate the FFT, the "in-place" variant is 
preferably used in which calculated intermediate 
results of the Butterfly calculation are written to the 
same memory, from where they are in turn read and put 
5 to further use, as shown in figure 1. This provides for 
particularly economical use of the memory. 

Figure 2 shows the calculation operation of the "in- 
place" variant for N - 8 in the form of a signal 

10 flowchart. As can be seen from figure 2, at the start 
of the calculation, the memory needs to contain the 
data in a particular arrangement, usually referred to 
as "bit-reversed" . At the end of calculation, the 
result can be read linearly. The calculation itself is 

15 performed in a plurality of stages, as shown in the 
signal flowchart in figure 2. In the cited example, 
three stages are necessary. In each case, two data 
items are read from the memory, the Butterfly is then 
calculated, and the two results are written back again 

20 to the same locations in the memory. In this context, 
the data items are not necessarily situated in adjacent 
memory locations, however. In addition, the calculation 
differs from one stage to the next. 

25 If the FFT is implemented in integrated circuits, the 
complexity is primarily determined by the memory used. 
In this context, large memories are usually of page- 
for-page design, which means that access to a memory 
cell within such a page is very fast; by way of 

30 example, such a memory access operation can be carried 
out within one clock cycle. Changing from one page to 
another takes significantly longer, however, i.e. a 
plurality of clock cycles. It is possible to increase 
the throughput in page-oriented memories by processing 

35 a page of a memory as completely as possible and only 
then changing the page again, because addresses of the 
other page are required. In the case of the 
aforementioned "in-place" FFT, however, the data are, 
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in principle, required in very unorganized form. Small 
memories do not have this drawback, since access to the 
individual cells is possible without restriction in 
this case. 

5 

The speed of the FFT is therefore primarily limited by 
the interface to the memory. 

The invention is therefore based on the object of 
10 providing a method and an apparatus for calculating 
discrete orthogonal transforms, in particular the FFT 
and I FFT, which permit faster calculation. 

The object is achieved by the features of the 
15 independent claims. Preferred refinements of the 
invention are the subject matter of the subclaims. 



The first inventive method for calculating an 
orthogonal discrete transform on the basis of the DIT 
20 method in prescribed intermediate steps, where an 
intermediate step is to be understood as meaning the 
addition and multiplication of a processing stage, 
comprises the following steps: 

a) the data are read from a memory organized on a page- 
25 for-page basis, 

b) the intermediate step prescribed by the transform is 
carried out, 

c) the resulting data are stored in a buffer memory, 
and 

30 d) the resulting data are written page-f or-page from 
the buffer memory to the memory organized on a page- 
f or-page basis. 



The inventive method for calculating an orthogonal 
35 discrete transform on the basis of the DIF method in 
prescribed intermediate steps has the following steps: 
a) the data are read from a memory organized on a page- 
for-page basis, 
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b) the data are stored in a buffer memory, 

c) the intermediate step prescribed by the transform is 
carried out, and 

d) the resulting data are written page-f or-page from 
5 the buffer memory to the memory organized on a page- 

f or-page basis. 



A third inventive method for calculating an orthogonal 
discrete transform in prescribed intermediate steps has 
10 the following steps: 

a) the data are read from two memories organized on a 
page-f or-page basis, so that the reading is 
organized on a page-f or-page basis, 

b) the intermediate step prescribed by the transform is 
15 carried out, and 

c) the resulting data are written page-f or-page to the 
two memories organized on a page-f or-page basis. 

Examples of a suitable discrete orthogonal transform 
20 are an FFT, an 1FFT, a DCT or an IDCT and also 
transforms organized on a schematically similar basis. 



Preferably, the transform has an identical geometry for 
each stage, which facilitates the addressing of the 
25 results. By way of example, this condition is satisfied 
for an FFT or an I FFT based on Singleton. 



In accordance with the above methods, the inventive 
apparatuses have 
30 a memory organized on a page-f or-page basis, a 
processor and a directly organized memory which is 
arranged downstream of the processor, or 

a memory organized on a page-f or-page basis, a 
processor and a directly organized buffer memory which 
35 is arranged upstream of the processor, or 

two memories organized on a page-f or-page basis and a 
processor . 
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In this context, the processor provides a "Butterfly". 

Preferred embodiments of the invention are explained 
below with reference to the drawings. 

Figure 1 shows the diagram for calculation of an "in 
place" FFT, 



Figure 2 shows the signal flowchart for an "in place" 
10 FFT for N = 8, 

Figure 3 shows the signal flowchart for a "Singleton" 
FFT for N - 8, 

15 Figure 4 shows a first embodiment for calculating an 
FFT , and 

Figure 5 shows a second embodiment for calculating an 
FFT on the basis of the DIT method. 



20 



25 



Figure 1 shows the aforementioned usual "in-place" 
calculation of an FFT, where a data pair is read from a 
memory 1, normally a DRAM, is logically combined in a 
Butterfly unit 2 and is written to the memory 1 again. 



Figure 2 shows the basic DIT algorithm for an FFT 
calculation for N = 8. For calculation, the data items 
x(0) to x(7) need to be present in the memory in "bit- 
reversed" fashion at the start. The figure shows the 

30 signal flowchart for calculating the Fourier components 
X{0) - X(7). The algorithm and the signal flowchart are 
described in A.V. Oppenheim, R.W. Schafer: "Digital 
Signal Processing", Prentice-Hall Inc., Englewood 
Cliffs, New Jersey, USA, 1975, pp. 285 ff., so that 

35 there is no need for this to be explained in more 
detail here. With regard to the nomenclature, the 
factors W°, w 1 and W 2 are also called twiddle factors. 
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Figure 3 shows an FFT algorithm with an altered 
sequence of computation operations, the "Singleton 
algorithm", which is described on page 301 of the 
aforementioned literature reference Oppenheim-Schaf er 
5 and is used in the method according to the invention. 
It is clear to see that the data are processed in very 
regular fashion, but no "in place" processing is 
carried out. Twice the memory space as compared with 
the "in place" calculation is therefore required. 

10 

Advantageously, the calculation has entirely the same 
structure in each stage, with the exception of the 
twiddle factors. This means that the stages of the 
illustrative algorithm for an FFT for N - 8 have an 
15 identical geometry. When a stage has been processed, 
the last memory written to is read and another memory 
is written to. 

The memory can be read linearly. Linear writing is not 
20 yet possible in the algorithm, however, since it is 
always necessary to store one result in the top half 
(solid line) and one result in the bottom half (dashed 
line) . However, linear writing is possible in the top 
and bottom halves. 

25 

Figure 4 shows a schematic illustration of a first 
inventive embodiment for rapidly calculating an 
algorithm which has a similar or the same design as the 
Singleton algorithm shown in figure 3. [lacuna] Such an 

30 apparatus, which is suitable both for DIF and DIT 
calculation, the memory required for the calculation is 
subdivided into two page-oriented memories 3, 4 which 
are of the same size and are in the form of DRAMs (DRAM 
= Dynamic RAM) . The use of two page-oriented memories 1 

35 and 2 means that each memory can inherently be written 
to in linear fashion, i.e. a respective memory is used 
for the top and bottom halves of the algorithm shown in 
figure 3, so that the number of slow page changes is 
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small. The number of memories and the number of 
interfaces are doubled, however. 

A second embodiment, shown in figure 5, for rapidly 
5 calculating a DIT transform like that in figure 3 is 
the use of a small fast memory 5 arranged downstream of 
the Butterfly 2. This memory buffer-stores a few 
intermediate results of the calculation so that it can 
then write them to the page-oriented memory 1 without 

10 constantly changing page. The course of the calculation 
is now that two data items are read from the memory 1, 
are supplied to the Butterfly, and the intermediate 
results are stored in the fast SRAM memory 5 (SRAM = 
Static RAM) . The intermediate results are then written 

15 page-f or-page to the page-oriented memory 1. 

In the case of calculation on the basis of the DIF 
method, the static fast memory 5 is situated at the 
input of the Butterfly 2 (not shown) . In other 
20 respects, the manner of operation is similar to that of 
the DIT variant explained. 

The table 1 below illustrates the addressing scheme for 
calculation using buffer-storage: 

25 

Table 1 
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In the example, the page-oriented memory has a depth of 
N = 32, which means that an FFT having 32 points can be 
calculated. A page of the slow page-oriented memory may 
5 have P = 4 addresses, for example, and the fast memory 
used may have a size of S = 4-P = 16 addresses. The data 
can be read linearly in the order 0, 1, 2, ... By way 
of example, for the purposes of processing, the 
Butterfly reads (RD) the data for the DRAM addresses 0 
10 and 1. The intermediate results of the Butterfly 
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produce addresses 0 and 16. These are written (WR) to 
the addresses 0 and 8 of the fast buffer memory SRAM. 
With a delay which is required for partly filling the 
memory, the data are read page-f or-page in linear 
5 fashion, that is to say, in table 1, under SRAM (RD) 
the addresses 0-3, then 8-11 etc. The content of these 
SRAM addresses is written page-f or-page in linear 
fashion to the appropriate addresses, which are 
conditional on the transform, of the slow page-oriented 
10 memory (column DRAM (WR) ) , that is to say on the basis 
of 0-3, 16-19, 4-7 etc., in the example. The use of the 
small fast memory therefore optimizes access to the 
page-oriented memory DRAM such that the slow page 
changes required are as few as possible. 



The transforms and memory sizes mentioned above and 
explained serve only as an illustration. In practice, 
the slow memory is much larger than the fast memory. By 
way of example, the slow memory is normally designed 

20 for N = 8192, with the slow memory having a page size 
of P = 16. The small memory therefore has a size of 64 
addresses. In an integrated implementation, the fast 
small memory is therefore of barely any consequence in 
terms of surface area, but calculation of the FFT or 

25 IFFT or of similar discrete orthogonal transforms is 
significantly speeded up on account of the minimization 
of the page changes in the slow memory. 
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Patent claims 



1 . A method for calculating an orthogonal discrete 
transform on the basis of the DIT method in 

5 prescribed intermediate steps, 

wherein 

a) the data are read from a memory (1) organized 
on a page-f or-page basis, 

b) the intermediate step prescribed by the 
10 transform is carried out, 

c) the resulting data are stored in a buffer 
memory (5) , and 

d) the resulting data are written page-f or-page 
from the buffer memory (5) to the memory (1) 

15 organized on a page-f or-page basis. 

2. A method for calculating an orthogonal discrete 
transform on the basis of the DIF method in 
prescribed intermediate steps, 

20 wherein 

a) the data are read from a memory (1) organized 
on a page-f or-page basis, 

b) the data are stored in a buffer memory (5) , 

c) the intermediate step prescribed by the 
25 transform is carried out, and 

d) the resulting data are written page-f or-page to 
the memory (1) organized on a page-f or-page 
basis . 



30 3. A method for calculating an orthogonal discrete 
transform in prescribed intermediate steps, 
wherein 

the data are read from two memories (3, 4) 
organized on a page-f or-page basis such that the 
35 reading is organized on a page-f or-page basis, 

the intermediate step prescribed by the transform 
is carried out, and 



the resulting data are again written page-f or-page 
to the two memories (3, 4) organized on a page- 
f or-page basis. 



The method as claimed in one of claims 1 to 3, 
wherein the discrete orthogonal transform is 
formed by an FFT, IFFT, DCT or IDCT. 



The method as claimed in claim 4, wherein the 
transform has an identical geometry for each 
stage . 



The method as claimed in claim 5, wherein an FFT 
or IFFT based on Singleton is used. 



An apparatus for carrying out the method as 

claimed in one of claims 1, 4-6, 

wherein 

the apparatus has a memory (1) organized on a 
page-f or-page basis, a processor (2) and a 
directly organized memory (5) which is arranged 
downstream of the processor. 



An apparatus for carrying out the method as 

claimed in one of claims 2, 4-6, 

wherein 

the apparatus has a memory (1) organized on a 
page-f or-page basis, a processor (2) and a 
directly organized buffer memory (5) which is 
arranged upstream of the processor. 



The apparatus as claimed in one of claims 7 or 8 , 
wherein the page-oriented memory (1) is a large 
memory in relation to the directly organized 
buffer memory (5) . 



The apparatus as claimed in claim 9, wherein a 
fast memory is used for the buffer memory (5) . 
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11. The apparatus as claimed in one of claims 7-10, 
wherein the page-oriented memory (1) is a DRAM and 
the buffer memory (5) is an SRAM. 

5 

12. The apparatus as claimed in one of claims 7 - 11, 
wherein the page-oriented memory (1) has a size of 
8 K addresses and the buffer memory (5) has a size 
of 32 - 64 addresses. 

10 

13. An apparatus for carrying out the method as 
claimed in one of claims 3, 4-6, 

wherein 

the apparatus has two memories (3, 4) organized on 
15 a page-f or-page basis and a processor (2) . 

14. The apparatus as claimed in claim 13, wherein the 
page-oriented memories (3, 4) are of the same 
size . 

20 

15 . The apparatus as claimed in claim 14 , wherein the 
page-oriented memory (3, 4) has a size of 4 K 
addresses . 



25 16. 



The apparatus as claimed 
wherein the processor (2) 



in one of claims 7 - 
provides a Butterfly. 
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